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Abstract. Internet is a complex network composed by several networks: the 
Autonomous Systems, each one designed to transport information efficiently. 
Routing protocols aim to find paths between nodes whenever it is possible (i.e., 
the network is not partitioned), or to find paths verifying specific constraints 
(e.g., a certain QoS is required). As connectivity is a measure related to 
both of them (partitions and selected paths) this work provides a formal lower 
bound to it based on core-decomposition, under certain conditions, and low 
complexity algorithms to find it. We apply them to analyze maps obtained 
from the prominent Internet mapping projects, using the LaNet-vi open-source 
software for its visualization. 



1. Introduction 

Nowadays, Internet is a highly developed network connecting people around the 
world. Its continuous growth raises new problems, challenging us to find novel 
and creative solutions. Modeling the Internet is a problem as relevant as difficult: 
A tight model of the Internet would provide us with a powerful tool to analyze 
applications behavior or test routing algorithms during their design. In the be- 
ginnings of the Internet |Waxman, 1988| was the first to introduce an Internet 
model to test his multicast protocol. Then other authors introduced models try- 
ing to improve Waxman's one adding hierarchy and other sophistications (e.g., see 
|Zegura et al., 1997] ) . But the |Faloutsos et al., 1999] paper was the first to show a 
remarkable characteristic of the Internet's topology: the degree distribution of the 
nodes follows a heavy-tailed behavior, instead of a Poisson's one as was believed 
(i.e. this was the case of the forementioned models which are a variant of the Ran- 
dom Graph |Erdos and Renyi, 1959 ). The unexpected consequence of this is that 
we have nodes with a large variety of degrees. 

From this discovery, new simple models aroused which tried to reproduce heavy- 



tailed behavior, such as the Barabasi and Albert, 1999 model based on preferential 



attachment, and later the |Fabrikant et al., 2002 model proving that a multivariate 



optimization problem may lead to heavy-tailed degree distributions. 

Later on, other properties were highlighted as important ones, like the average 
neighboring degree distribution by |Pastor-Satorras et al., 20'0l] . This work showed 
that the Internet topology at the Autonomous System level (AS) is different from 
the Internet topology at the inter-router level (IR). On this basis, we think that 
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a better understanding and modeling of the Internet's nature may be achieved by 
discovering new significant properties. 

In this paper we turn our attention to connectivity in graphs, i.e., the num- 
ber of different independent paths between their nodes. It is quite relevant as it 
represents the robustness of the network: how tolerant it is to link failures, or 
what QoS it may offer when needed (e.g, see the RFCs of Multiprotocol Label 
Switching |Rosen et al., 2001] |Andersson and Asati, 20 09|). Also with regard to 
QoS, forwarding packets to more connected nodes helps to find paths that fulfill 
the needed QoS, therefore routing protocols like BGP Rekhter et al., 2006 might 
take advantage of information about connectivity. Here we take different subgraphs 
(i.e., portions of the topology) and analyze each one's connectivity. 

In particular, we introduce the concept of core-connected graphs. These graphs 
have the nice property that the minimum shell-index is a lower bound for the 
connectivity between two nodes. As graphs are not core-connected in general, we 
also present algorithms to find core-connected induced subgraphs. These algorithms 
are based on a generalization of Plesm'k's theorem [Plesmk, 1975| . 

Finally, it has been shown previously that the fc-core decomposition is capable of 
identifying networks sources by means of the visualization |Alvarez-Hamelin et al., 2006[ 
|Beir6 et al., 2 008 , may be used to vali date models |Serrano et al., 2006 , and dis- 
cover exploration biases on the Internet |Alvarez-Hamelin et al., 2008 . These facts 
support the fc-core analysis as one of the relevant tools to describe the Internet 
topology maps. 

The paper is organized as follows. Section [2] is devoted to formalize the relation 
between fc-cores and fc-connected subgraphs. Then in Section [3] we present a tool 
that implements these ideas giving a visualization of the network according to its 
connectivity, and also a data file containing the connectivity bound for every pair 
of nodes. We show some applications to Internet maps. Finally, we conclude with 
some important remarks and comments on future work. 



2. Mathematics 

Let us introduce some general graph notions and notation. Let G be a simple 
graph [i.e. a graph with no loops, no multiple edges) with vertex set V(G) and 
edge set E(G) (we follow in notation the book |West, 2001] ). Given A,B <Z V(G), 
[A, B] is the set of edges of the form ab, joining a vertex a £ A to a vertex b E B. 
As we consider edges without orientation, [A,I?] = [-B,A]. Abusing of notation, 
for v £ V(G),A C V(G), we write [v,A] instead of [{«},!]. The degree of a 
vertex v G V(G) is d G (v) = \[v, V(G)]\. We shall denote n(G) = \V\,e(G) = 
\E\,S(G) = min„ g y do{v), A(G) = max„ g y d<3(i>).The neighborhood of a vertex v, 
N(v), is the set of vertexes w such that vw S E(G). Given A C V(G), G(A) is 
the graph G' such that V(G') = A and E(G') is the set of edges in E{G) having 
both endpoints in A. Given v,w E V{G), pc{v,w) is the distance in G from v to 
w, that is the minimum length of a path from v to w. If v G V(G),A C V(G) we 
set pc{v,A) = imn w< z a Pg{v,w). We shall also use the notation 

PA = max : p G (A)(a,b) 

for the diameter of G(A). 

We introduce some notions of connectivity used along the paper. An edge cut 
in G is a set of edges [S, S], where S C V(G) and S == V(G) \ S are non void. 
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The edge-connectivity of G, fc'(G), is the minimum cardinal of the cuts in G. 
We say that G is fc-edge-connected if k'(G) > k. 

Menger's theorem has as a consequence that, if G is fc-edge-connected, given two 
vertices v,w in V(G) there are at least fc-edge-disjoint paths joining v to w (see 
| West, 2001| , pp.153-169). 

As we shall not deal in this work with vertex-disjoint paths, in the sequel we 
shall speak of fc-connectivity, avoiding the reference to the edges. 

2.1. Cores decomposition. Let, for A C V, 

ip(A) = mmd G{A) (v) 

Notice that for A 1 ,A 2 C V, 

i(>(Ai U A 2 ) > min(^(Ai), ip(A 2 )) 

and as a consequence for any k 

C k = U{A : ip(A) > k} 

satisfies ip(Ck) > k, and of course Ck contains any other set that satisfies this 
property. 



The preceding remark justifies the following Batagelj and Zaversnik, 2002 



Definition 1. A subgraph H = G(Gfc) induced by the set Ck C V is the k- 
core (or the core of order k) in G if Ck is the maximal subset of V such that 
mm veC d G ( C )(v) > k. 

Thus, if we let k max = maxjfc : Ck ^ 0}, we obtain the decomposition 

V = U{C fc : < k < fc max } 

where C k +i C C k , < k < fc max - 1. 

A fc-core of G can be obtained by recursively removing all the vertices of degree 
lower than k, with their incident edges, until all the vertices in the remaining graph 
have degree greater than or equal to k. This decomposition can be easily imple- 
mented: the algorithm by Batagelj and Zaversnik, 2003 has a time complexity of 
order 0{n(G) + e(G)) for a general simple graph G. This makes the algorithm very 
efficient for sparse graphs, where e(G) is of the same order that n(G). 

Definition 2. Let Sk = Ck \ Gfc+i,0 < k < fc max — 1, 5fc max = Gfc max . We call Sk 
the k-shell of G, and if v € Sk we say that v has shell-index k, sh(v) = k. 

Notice that Ck is thus the union of all shells S s with s > k, and that the shells 
are pairwisc disjoint. 

Definition 3. Every connected component of S s will be called cluster. 

Each shell S s is thus composed by clusters Q"\ such that S s — U{Q" 1 : 1 < m < 
<7max(s)}, where q max (s) is the number of clusters in S s . 

In this paper we address the following expansion problem: given a fc-edge- 
connected graph G2, give conditions under which the result of adjoining to G2 
a graph Gi will be also k edge-connected (see Corollary [2] below). 
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2.2. ^-connectivity. We may analyze connectivity in a strict or wide sense. 

Definition 4. Let A C V(G). 

(1) We say that A is k-connected in strict sense if G(A) is k-connected, i.e 
every cut in G(A) has at least k edges. That is, given u,v £ A, there exist 
at least k edge disjoint paths form u to v in G{A). 

(2) We say that A is k-connected in wide sense if every cut [X, X] in G such 
that X n A and XnA ^ has at least k edges. That is, given u,v £ A, 
there exist at least k edge disjoint paths form u to v in G. 

Of course, if A is fc-connected in strict sense, it is also fc-connected in wide sense. 

Lemma 1. Let A,B cV and An B ^ 0. Then 

(1) If A and B are k-connected in strict sense, so is AU B. 

(2) If A and B are k-connected in wide sense, so is A(J B. 

Proof. 

(1) Let u,v £ AU B. If u, v £ A or u, v £ B, then there are k disjoint paths 
in G(A) or in G(B) from u to v, and in any case there are k disjoint paths 
in G(A U B). Suppose then that u £ A, v £ B, and let [X, X] be a cut in 
G(A U B) with u £ X, v £ X. Suppose that s £ A n B and, without loss 
of generality, assume that s £ X . Then there are k disjoint paths in G(B) 
from s to v, whence \[X, X]\ > k. 

(2) Let u, v £ A U B. If u, v € A or u, v € B, then there are k disjoint paths 
in G from u to v. Suppose then that u £ A, v £ B, and let [X, X] be 
a cut with u £ X, v £ X. Suppose that s £ A n B and, without loss of 
generality, assume that s £ X . Then there are k disjoint paths in G from 
s to v, whence \ [X, X]\ > k. 

□ 

This lemma has as a consequence that given v £V, 

U{A : A is /c-conncctcd in strict sense and v £ A} 

is k connected in strict sense (and we call it the fc-connected component of v in 
strict sense), and that 

U{^4 : A is fc-connected in wide sense and v £ A} 

is k connected in wide sense (and we call it the fc-connected component of v in wide 
sense). 

Lemma 2. For fc = 1,2 the k-connected components in strict sense and in wide 
sense are the same. This is not true for k > 3. 

2.3. An expansion theorem. Let G be a simple graph. Let Q,C C V(G), and 
set C' = Q U C, G' = G(C'). We assume in the sequel that Q and C are non void 
and that Q n C = 0. We define, for x,y £ Q, the contracted distance 

Pc"/c(x,y) = m\n{p G(Q) {x,y),p G> {x,C) + p G >{y,C)} 

and for x £ C' ,y £ C 

Pc/c(x,y) = Pc/c(y,x) = PG'{X,C) 
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If x € C and A C C , we set pc> /c{ x > A) = min a eA Pc /c( x i a )- We shall use also 
the notation 

Pc/c = max pc/c{x,y) 

x,yeC 

for the contracted diameter of C . 

Notice that with these definitions, if Pc /c{ x tD) — 2 for some x,y <G C", then 
there exists z e C such that p c >/ c (x,z) — p c > /c{z,y) = 1. Notice also that 
Pc /c i s : whence the notation, the pseudo-distance induced in C" by the distance in 
the quotient graph G'/G(C), obtained from G by contracting C to a vertex (this 
quotient graph is not a simple graph). 

We shall also use the notations 

&Q = {xeQ:\[x,C}\>j} 
B j Q = {xeQ:\[x,C]\<j} = Q\d j Q 
Under these settings, we consider also 

*o/C = E min{max{l, \[x,B 2 Q}\}, \[x, C}\} 

xl£Q 

As Q and C will be fixed in the following of this Section, we shall freely omit the 
subindex C'/C when speaking of pc/c an d For v € C, N'(v) denotes its 

neighborhood in G' . 

In this general framework, we have 

Theorem 1. If pc/c < 2 7 [S, S] is an edge cut in G' such that C C S, and we let 

Si = s n Q 

(1) //max s - eg p(s,5) = 1, tfien > max s - eg \N\s)\. 

(2) //max s - e gp(s,S) = 1, then \[S,S] > \S\. 

(3) //max s - e gp(s,S) - 2, tfjen |5| > min s - e g_|Ar'(s)|. 

(4) // maxg e g p(s, S) = 2, £/ien max se s p(s, 5) = 1. 

(5) //max seSl(0 ( s ,5) = I, then \ [S u S}\ > max seSl (\N'(s)\ - \N'(s) n C\). 

(6) I/max seSl/) ( S ,S) = I, then \[S U S}\ > |Si|. 

Proof. 

(1) Suppose that for any s e S: p(s, S) = 1. Let s £ S. Then we have fci edges 
ssi, 1 < i < fci with Si e S and (eventually) fc 2 edges slj, sj € 5. But each 
Sj satisfies p(sj, S) = 1, thus we have fc 2 new edges (here we used that G' 
is simple, because we assumed that the vertices Sj are different) SjSj, with 
s'j e 5, whence 

\[S,S]\>h+k 2 = \N'(s)\ 

(2) This follows at once if we notice that in this case for each s € S there is at 
least one s £ S such that ss € [5, 5]. 

(3) Notice that if p(s, S 1 ) = 2, then {s} U iV'(s) C S. 

(4) Let s € S 1 be such that p(s~o, S 1 ) = 2. Then for each s € 5 1 , as p(s~o, s) = 2, 
there exists s' such that p(s~o, s') = p(s', s) = 1. But, again, as p(s~o, S) = 2, 
it follows that s' € 5, hence p(s, S) = 1. 

(5) For each s <G Si we have 

iv'(s) = (jv'(s) n 5) u n Si) u (n'(s) n C) 

and, if maxggSj p(s, 5) = 1, then for each s' G N'(s) n Si we have at least 
one edge in [s\ S], thus |JV'(s)| < \[S U S}\ + \(N'(s) n C)|. 
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(6) Our last statement follows noticing that max s6 Si p(s, S) = 1 means that 
for s G 5i | [a, S]\ > 1, and these sets are pairwise disjoint subsets of [Si, S]. 

□ 




(c) (d) 



Figure 1. Conventions: 1. Filled polygons represent cliques, 
and curved arcs represent edges. 2. The dotted line separates C 
(the upper set of vertices) from Q . 3. The widest arc shows 
the cut [S, S\. 4. k — min„ g Q \N'(v)\. Descriptions: (a) Here 
\[S,S}\ = 3 < k = 4, \Sf] Q\ = 2, \S\ = 5, $ = 3, S f) Q = d 2 Q. 
(b) Here \[S,S}\ = 3 < k = 4, \S n Q\ = 1, \S\ = 5, $ = 3, 
SHQ^ d 2 Q. (c) Here \[S,S}\ = 4 < k = 5, \S n Q\ = 1, \S\ = 6, 
$=3,SnQ^ 9 2 Q. (d) Here IfS 1 ,^ = 1 < k = 2, |5 n Q\ = 0, 

|5| = 3, $ = i,5ng = a 2 Q = 0. 



Corollary 1. issame t/iat in addition to the hypotheses of Theorem [7J we have 
\[S,S]\ <min^ eQ |JV'(«)|. Then 

(1) m&x seS p(s,S) = 2. 

(2) m_ax seS p(s, S) = 1. 

(3) |[5,C1|>1. 

(4) |5i| < < min^Q |iV»| < \S\. 

(5) 5ng c 9 2 Q,5d d 2 Q. 

(6) *<|[5,S]|. 

(See the examples in Figure [TJ) 

Proof. Points [T] and [2] are obvious consequences of our new hypothesis and points 
[T] and 3] in Theorem [T] To show point [31 notice that max s6 s p(s, S) = 1. The 
first (from left to right) of the inequalities stated in point 2] follows from point [5] in 
Theorem (TJ and point [3] in the present Theorem. The second of these inequalities 
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is assumed by our additional hypothesis, and the third follows immediately from 
point [3] in Theorem [T] 

From point [5] in Theorem [1] and | [C, S] | > 1 we obtain 

\[S, S]\>max seSl (\N'( S )\-\N'(s)nC\) 

Thus for s e Sx 

N'( S )>\[S,S]\>(\N'(s)\-\N'(s)nC\) 

whence \N'(s) PI C| > 2. Point [5] follows immediately from this. 
By our previous points, if s G S PI Q then 

|[s,5]| >max{l,|[ S ,a 2 Q]|} 

and of course for s € 5, |[s, S]| > |[s, C]|, thus 

\[S,S}\ = \[SDQ,S]\ + \[S,C}\ 

> max{l,|[ S ,a 2 Q]|} + ^|[ S -,C]| 

> $ 

□ 




(a) (b) 



Figure 2. Conventions: 1. Filled polygons represent cliques, 
and curved arcs represent edges. 2. The dotted line separates C (the 
upper set of vertices) from Q. 3. The widest arc shows a minimal 
cut [S,S]. Descriptions: (a) Here \[S,S]\ = k = 4, $ = 4, 
= 3. (b) Here \[S,S]\ = k = 3, $ = 1, I^QI = 1, Q = ^Q. 
This example shows that Corollary [5] includes an edge-connectivity 
version of the Expansion Lemma in W est, 2001] , Lemma 4.2.3. 

Corollary 2. Let k < S(G'), and assume that 

(1) G(C) is S(G')-edge connected 

(2) p c , /c < 2 

Then any of the following 

(1) $ > k 

(2) \d 1 Q\ > k 

(3) Q = d x Q 

implies that G is k- edge- connected. 

(See the examples in Figure O) 
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Proof. Let [S, S) be any cut in G' . We shall show that, under the listed hypotheses 
and any of the alternatives, \[S, S]\ > k. 
If S n C ^ and S n C ^ 0, then, as 

[snc,snc]c [s, s] 

is a cut in G(C), which we assumed to be fc-edge connected, we obtain \ [S, S] \ > k. 

Without loss of generality, we assume in the sequel that C C S. We argue by 
contradiction assuming that there exists some S such that \[S, S]\ < k, so that we 
are under the hypothesis of Corollary [TJ 

The first of our alternative hypothesis contradicts point [6] in the conclusions of 
Corollary [TJ 

When ugS'Q, 

mn{max{l,|M 2 Q]|},|[u,C]|} > 1 

so that we have \d 1 Q\ < <£>, i.e. the second of our alternative hypothesis implies 
the first one. 

To finish our proof, notice that if Q = d 1 Q : as S C Q, we have p(s, S) = 1 for 
any s G S, contradicting point [1] in the conclusions of Corollary [TJ □ 

Definition 5. As this Corollary will be a key for our later results, we shall set for 
future reference 

9c/c(k, G) = max{$c/c ~ k, |0 X Q| - k, ^Q] - \Q\}, for k < 6(G') 

so that the validity of some of the three last alternative hypotheses in Corollary [H 
could be re-stated as ^c'/c(k,G) > 0. 

2.3.1. Remarks. Corollary [2] is related to a well known theorem of Plesnfk (see 
|Plesmk, 1975| , Theorem 6), which states that in a simple graph of diameter 2 the 
edge connectivity is equal to the minimum degree. 

2.4. Edge-connectivity and cores decomposition. Let us introduce the fol- 
lowing 

Definition 6. Consider a graph G, its cores decomposition (see Definition^} 

V(G) = U{C fc : < k < fc max } 

and A C V{G). 

(1) We say that A is k-core connected in strict sense if AnCk is k-connected 
in strict sense. 

(2) We say that A is k-core connected in wide sense if AnCk is k-connected 
in wide sense in G{Ck). 

(3) We say that A is core connected in strict sense if AnCk is k-connected in 
strict sense for all k such that An Ck =/= 0. 

(4) We say that A is core connected in wide sense if AnCk is k-connected in 
wide sense in G(Ck) for all k such that A n Ck ^ 0- 

(5) We say that G is k-core connected when V(G) is k-core connected. 

(6) We say that G is core- connected when V{G) is core- connected. 

Next, we shall describe two algorithms that provide a mechanism to build (hope- 
fully big) core-connected sets of vertices, in strict and wide sense respectively. Both 
algorithms proceed recursively, starting from the highest core, looking for a not- 
yet-joined cluster able to be joined, joining it, and restarting from this new set of 
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vertices. The difference between the algorithms lies in the meaning of "able to be 
joined" . 

When Q is a cluster, we denote k(Q) its shell index. Let be a family of clusters, 
and let fc(0) = max{fc(Q) : Q e 0} denote the maximum shell index of the clusters 

in 0. 

Algorithm 1: strict sense core-connected 
Input: 0, the family of all clusters of the k-core decomposition of a graph G 
Output: C C V, core-connected in strict sense 
Initialization: C <— 0, k <— k max 
l begin 

2 

3 



4 
5 
6 
7 
8 
9 
10 



11 
12 
13 
14 
15 
16 



while C = and 0^0 and k > 2 do 

k <- fc(0) ; 

if there is some Q € satisfying: 

| C^CUQ; 
end 

0^0\{Qe0:fc(Q) = fc} ; 
end 

while ^ and fc > 2 do 

fc «- fc(0) ; 



while there is some Q G satisfying: 



C^- CUQ ; 

0-0\{Q} ; 

end 

0^0\{QG0:fc(Q) = fc} 



PQ <2 



then 



fc(Q) - fc 
Pcuq/c < 2 
*cu Q /c(fc,G)>0 



do 



end 



17 end 



(see Figure |3] for illustration) 

Theorem 2. Algorithm]]] always stops, and when it stops the set C is core con- 
nected in strict sense. 



Proof. 



For the first while loop, step Q] computes the maximum k for the actual 
family of clusters, while stepQ]deletes from all clusters with shell index fc. 
As a consequence, fc is strictly decreasing, and when the algorithm arrives 
to step [U either is empty or C has a cluster verifying the hypothesis of 
Corollary 

The second while loop will also finish because steps Q] and Q] assure that fc 
is strictly decreasing. 

The nested while loop will finish because the family is finite. 
Assume that the actual C is core connected in strict sense when we arrive 
to step Q] By construction, C n Cj — C when j < fc, and C n Cj are 
previous instances of C when j > k (in fact, this instances are obtained 
each time that we arrive to step [lj . The new C, let us call it C for a 
moment, has the same intersections with Cj for j > fc, and when j < k the 
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intersection is C = CUQ, that is k connected in strict sense by Corollary[2j 
as the conditions for the selection of Q in step Q] match the hypothesis of 
Corollary [21 Thus, all the instances of C during the algorithm are core 
connected in strict sense, whence the final C is core connected in strict 
sense. 

□ 

Algorithm 2: wide sense core-connected 

Input: 0, the family of all clusters of a graph G 

Output: C C V, core-connected in wide sense 

Initialization: C^0,£>^0,0'^0, fc <- k max 
l begin 

2 

3 



4 
5 
6 
7 
8 
9 
10 
11 
12 
13 



14 
15 
16 
17 



18 
19 
20 
21 
22 
23 



while C = and ^ and k > 2 do 

k <- fc(0) ; 



do 



if i/iere ?s some Q € satisfying: 

C^CUQ ; 
0^0\{Q} ; 
end 

0'^0'U{Qe0:fc(Q) = fc} ; 
0^0\{QG0:fc(Q) = fc} ; 
end 

while 0^0 and ft > 2 do 

ft <- fc(0) ; 

while £/iere zs some Q' 6 0' satisfying: 
( k(Q')>k 

\ P(CUDuQ')/(,C'UD) < 2 

*(CUDUQ')/(CUD)( fc 7 G) > 

D <- L> U Q' ; 
0'^0'\{Q'} ; 
end 

while i/iere is some Q € satisfying 
( k(Q) = k 

\ P(CUDuQ)/(C'UD) < 2 

v I'(cunuQ)/(cun)(fc, G) > 

0^0\{Q} ; 
end 

0'^0'U{Qe0:fc(Q) = fc} ; 
0^0\{Qe0:fc(Q) = fc}; 
end 



ft(Q) = k 
PQ<2 



then 



do 



24 end 



(see Figure [3] for illustration) 

Theorem 3. Algorithm^ always stops, and when it stops the set C is core con- 
nected in wide sense. 

Proof. 
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• For the first while loop, step [2] computes the maximum k for the actual 
family of clusters, while step [2] deletes from the actual all the clusters 
with shell index k. As a consequence, k is strictly decreasing, and when the 
algorithm arrives to step [2l either is empty or C has a cluster verifying 
the hypothesis of Corollary [2l 

• The second while loop will also finish because steps [2] and [2] assure that k 
is strictly decreasing. 

• The nested while loops will finish because the families 0' and are finite. 

• Assume that the actual C is core connected in wide sense when we arrive 
to step [2] By construction, C PI Cj = C when j < k, and C fl Cj are 
previous instances of C when j > k (in fact, this instances are obtained 
each time that we arrive to step [2]) . The new C, let us call it C for a 
moment, has the same intersections with Cj for j > k, and when j < k the 
intersection is C = CUQ, that is k connected in wide sense by Corollary^ 
as the conditions for the selection of Q in step [2] match the hypothesis of 
Corollary O Thus, all the instances of C during the algorithm are core 
connected in wide sense, whence the final C is core connected in wide 
sense. 

□ 

Notice that in both algorithms we have limited C to include nodes from the 
shells with k > 2. The conditions from Corollary [2] are not natural at level 1. It is 
obvious that if C is the strict sense core-connected set constructed by Algorithm [U 
then after adjoining to C all the clusters from shell 1 that are connected to it, the 
result is also strict sense core-connected. Analogously, it is clear that if C and D 
are the sets constructed by Algorithm [21 if we adjoin to C all the clusters from 
shell 1 connected to C or D, then the new C is also core-connected in wide sense. 
We assume in the following Section that C has been extended according to these 
remarks. 

3. Applications 

In this section we will test our algorithms in some Internet maps obtained from 
different sources. Each of them has its own biases and explores the Internet at a 
particular level: 

(1) The Route Views Project |ORV, 2001| , for instance, uses a short amount of 
BGP routers to peer with routers in other ASes and thus get routing tables. 
As a bias, this method does not detect hidden routes (not all inter- AS routes 
are public due to policies and agreements). 

(2) The CAIDA Association |CAIDA, 1998| developed skitter probes (which 
now evolved into the Ark infrastructure). Based on traceroute, this mea- 
surement nodes send ICMP requests to routers in order to discover paths. 
Ark has 38 monitors at November 2009. As an advantage, these tool finds 
the real routes followed by packets, advertised or not. 

(3) The DIMES Project |DIMES, 2004] is a distributed system composed of 
around one thousand voluntary nodes -anyone may subscribe even with a 
low CPU power or bandwidth-. It explores the Internet with tools such as 
traceroute and ping to discover the topology. 
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(a) Strict sense connectivity 
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(b) Wide sense connectivity 



Figure 3. These two schemata show the progress in time (t) of 
both algorithms. Dots represent clusters, circled dots clusters in 
C, squared dots clusters in D. Both algorithms start looking for a 
cluster of diameter 2, descending from the top shell. If they find 
someone, this is the initial C . Then, the first algorithm looks, 
descending shell by shell, for clusters to add to C, preserving core- 
connectivity in strict sense. The second algorithm does the same, 
but each time that it ends with the inspection of a shell, it looks 
between the omitted clusters from previous shells (saved in £}'), 
for clusters to add to D. These clusters provide more possibilities 
of connection (in wide sense) to the next shell, thus producing a 
bigger set C. 



(4) Mercator Govindan and Tangmunarunkit, 2000 uses hop- limited ICMP 



probes to discover the Internet map with an informed random heuristic. 
The algorithm chooses some nodes in the path with the IP source-routing 
option, which is no longer available. 

At the Autonomous Systems interconnection level (called AS) data has been 
obtained from the Oregon Route Views Project, the CAIDA Association and the 
DIMES Project. Router level maps (IR) come from CAIDA, DIMES and Mercator. 

To test our algorithms Q] and [2] we implemented them into the LaNet-vi open 
source software |Alvarez-Hamelin et al., 2005| . Based on the fc-core decomposition, 
this tool computes and visualizes the strict and wide core-connected components 
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giving also the lower bound of connectivity for every pair of nodes through a logging 
file. This bound is obtained applying Corollary [2] for nodes in the core-connected 
subgraph, and using the value of <& for the others. In the LaNet-vi color visualization 
(B&W differences will be mentioned in brackets), the nodes color (nodes shade) 
determines their core, while the border color (border shade) suggests a lower bound 
for connectivity with other nodes. In fact, the absence of border points out that 
the node belongs to the core-connected subgraph and so its connectivity with nodes 
in inner shells from the core-connected subgraph is at least the node shell index. 
A colored border (shaded border), instead, implies that the node is not in C but 
it belongs to the D set in algorithm [2j meaning that we assure certain level of 
edge-connectivity with internal clusters in C though this bound is lesser than the 
node shell index. Finally for white nodes (squared nodes) we can assure no edge- 
connectivity with others, and maybe the node is poorly connected. 
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Figure 4. Comparison between 7 and a. The line a = 7 splits the 
space in two areas, the upper one for graphs with C concentrated 
on high populated shells (7 > a), and the bottom for C mainly in 
low populated shells (7 < a). 



We performed different tests to show the effectiveness of our tool. Defining the 
fraction of nodes in C (the core-connected subgraph) for each shell k as: 



= \s k nc\ 
\s k \ 
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In fact, a, /3 and 7 may be interpreted as probabilities for a node to be in C under 
different models: a is the average of all pk', (3 is a weighted average of pk, where 
each shell is weighted according to its number; and 7 is a weighted average of the 
fractions pk with each fc-shell weighted by its size. 

Figure H shows 7 as a function of a. Networks with 7 > a have most part of C 
in high populated shells. We also present (3 as a function of a in figure [5l where 
(3 > a means that a lower part of C is found in lower shell indexes. 

On the one hand, AS maps are close to point (1,1) in both figures, and this 
means that most of their nodes belong to C. AS DIMES maps have (3 < a because 
some of their higher shells are empty. But on the other hand, IR maps are worse 
core-connected than AS because the ratio of nodes in C is low. Each IR has a 
different behavior; the most preferable is IR DIMES (90% of nodes in C and in 
higher shells). The reason why the other IR maps give a small C is the presence of 
big clusters in which the p < 2 condition is not satisfied. This is probably a bias in 
the exploration: IR DIMES maps may be more accurate because they use a higher 
number of sources (thousands, according to |Dall'Asta et al., 2006| ). We think that 
a detailed picture of the Internet at the IR level will verify our hypotheses. 

A different analysis was done at shell level. At first we counted pairs of nodes 
{u, v} belonging to the wide-sense core-connected graph, i.e., having both ends in 
C. Figure [S] presents this information as a function oimin{sh(u), sh{v)) for the AS 
CAIDA 2008 map. It follows that only the first three shells have nodes out of C, 
and that they are few (about 1% per shell). This behavior is similar in all maps but 
IR CAIDA 2003, where some nodes in low, medium and high shells do not belong 
to the core-connected subgraph. 

Secondly, we compared the minimum shell index of a pair of nodes {it, v} in a 
core-connected subgraph with the connectivity, this last obtained with the Gomory- 
Hu algorithm (see chapter 4 in |Ford and Fulkerson, 1962] ). In figure [7] we dis- 
play the connectivity of each pair {u, v} as a function of the minimum shell index 
min(sh(u),sh(v)), for the AS CAIDA 2008 map. We see that connectivity is higher 
than the bound obtained by the core-connected graph, still the average values are 
relatively close for low and medium shells (less than 100% up to shell 13). As 
our bound is related to connectivity through cores (i.e., taking only the maximum 
fc-core containing {u, v} to find paths) we show also the connectivity through the 
fc-core, where k = min(sh(u), sh(v)) in Figure [5] Clearly, connectivity is larger 
than connectivity through the fc-core, but this last is closer to the minimum shell 
index. The other AS maps have analogous behavior. 

As a remark, we couldn't compute connectivity for IR maps because it is expen- 
sive on RAM memory for their size, which is greather than 100,000 nodes. 
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Figure 6. Histogram showing the number of paths from u to v in 
the AS CAIDA 2008 map. white is for {it, v} G C and black is for 
u £ C or v € C. 



To conclude our analysis, we present in figure [9] a color visualization for AS 
CAIDA 2008 map (greyscale is shown in figure [T0"|). testing the algorithm [2] 
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Figure 7. Connectivity for {u,v} pairs vs. their minimum shell 
index. Circles stand for mean values per shell index and error 
bars show the standard deviation; crosses show connectivity; the 
bounding line connectivity — shell index is also displayed. 
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Figure 8. Connectivity through the cores for {it, v} pairs vs. their 
minimum shell index. Circles stand for mean values and error 
bars show the standard deviation; crosses show connectivity; the 
bounding line connectivity — shell index is also displayed. 
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Figure 9. Color visualization of AS CAIDA 2008 map displaying 
the wide-sense core-connected subgraph (algorithm [2]). Bottom: a 
detail showing some nodes out of the core-connected subgraph, and 
some in white (out of C and D); we can see one node belonging to 
D on the right. 



4. Conclusions 

In this work we have defined core-connected graphs, for which we obtain a lower 
bound of connectivity between nodes: the minimum shell index of them in the graph 
fc-core decomposition. To formalize this relation we provided a theorem (Corollary 
2) that gives sufficient conditions to assure the forementioned lower bound. This 
theorem is an extension of Plesnik's theorem (see |Plesmk, 1975] ) : 

Plesnik's theorem asserts that if a simple graph has diameter 2, then the con- 
nectivity is at least 5(G). It is not difficult to show that the hypotheses of G 
being simple can be easily relaxed, thus obtaining that the connectivity is at 
least min^gy |./V(v)| (which is 5(G) in the simple case). The combinatorial na- 
ture of connectivity, where multiple bifurcations give rise to a great multiplicity of 
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Figure 10. Greyscale visualization of AS CAIDA 2008 map dis- 
playing the wide-sense core-connected subgraph (algorithm [2]). 
Bottom: a detail showing some nodes out of the core-connected 
subgraph, and some in white (out of C and D); we can see one 
node belonging to D on the right. 



paths between two vertices, in the absence of bottle necks, justifies the presump- 
tion that the diameter 2 condition is rather artificial. In fact, |Carmi et al., 200"6 
Carmi et al., 2007] have noticed that the node to node connectivity k'(u,v) is at 
least mm(sh(u), sh(v)) for all but very exceptional pairs u,v in many real life net 
graphs (indeed, the connectivity in this paper is less than k 1 , because the authors 
count only disjoint paths, where disjoint means that they do not share neither edges 
nor vertices). Our results herein show some semilocal conditions under which this 
bound for the connectivity holds, where semilocal means here that the conditions 
involve, for each k, the relations between a fc-core and his next (k— l)-shell. Rather 
that an alternative procedure to find the connectivity, which can be rather efficiently 
found with the Gomory-Hu algorithm (see chapter 4 of |Ford and Fulkcrs on, 1962| ), 
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we hope that our results give some new insight on the local-global relations for con- 
nectivity, useful in real life net graphs. 

We also developed two algorithms to get core-connected subgraphs of a given 
graph G: one for strict-sense connectivity (whose complexity is 0(e)) and one for 
wide-sense connectivity (0(e x y/e)). In the strict-sense algorithm each cluster is 
considered only once to determine if it fulfills conditions about its diameter and 
Being e(Q) the amount of edges of cluster Q counting the internal ones and the 
ones that connect it with inner cores (clusters don't share edges with other clusters 
in the same core), then in 0(e(Q)) it can be determined if the cluster's diameter is 
less or equal to 2. The $ condition can also be evaluated in 0(e(Q j) as it implies a 
BFS in the cluster. Consequently, covering all clusters, the algorithm runs in 0(e). 
In the wide-sense algorithm each cluster may be considered up to k max times (once 
per each shell). But k max is bounded above by y/e because to obtain a fc-shell, 
k + 1 nodes are needed at least with k connections each, so the graph must have 
k x (k — 1) edges. Then we get a complexity of 0(e x y/e). 

Finally we included these algorithms in the open-source software LaNet-vi to 
visualize core-connected subgraphs and list nodes in them, showing that it works 
for the Internet maps. 

We are working on a possible relaxation on the conditions involved with $ and 
the treatment of clusters with p > 2. We will also look for explanations on the high 
connectivity found in the higher shells. 
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